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METHOD TOR DETERMINING THE DROP 
RATE, THE TRANSIT DELAY AND THE 
BREAK STATE OP COMMUNICATIONS 
OBJECTS 

FIELD OF THE INVENTION 

This method determines the drop rate, tbc transit delay 
and the fertile stale or communications objects using the 
topology (connectivity) of these objects. 

BACKGROUND TO THE INVENTION 

Existing methods for detenmniog whether Or not a com* 
municatious device is broken depend on periodically send* 
ing frames to it which require the device to respond (e.g. 
SNMP requests and responses (RFC 1157)). The absence of 
any response to a sequence of requests indicates tbc device 
is either broken or that the conimurucatioos path to the 
device is broken. Toe best method for exploiting this infor- 
mation using knowledge of the oerwork topology is reported 
by Dawes et a! (Network Diagnosis by Reasoning in Uncer- 
tain Nested Evidence Spaces: N. W. Dawes, J. Allot}, B. 
Pagurck; IEEE Transactions on Communications, WZ, 43, pp 
466-476, 1995). This earlier method docs not exploit mea- 
surements of me traffic rates on lines connected to devices 
and so is far more complex and far later to detect break faults 
thin the method described below. It also is marginally less 
accurate. Commercially deployed break fault methods are 
very significantly inferior to even this previous method. 

Existing methods for deteonming the transit delay across 
a device rely on requesting this information from, the device 
itself, in tbc case where tbc device measures this delay and 
records U so it can be read externally. However, many 
devices do not have these facilities. Many of those that do, 
do so in a manner which fc particular to thai version of that 
manufacturer's device, placing the information hi certain 
variables somewhere in the MIR (RFC 3213), This makes 
the process of dcicrminicg (he transit delay across a device 
cumbersome and complex, as variation need to be made for 
tbc particular device type. 

Existing methods for determining the drop rate of a device 
depend on what percentage of responses it makes to man- 
agement requests. Tbey do not use knowledge of the local 
topology of objects and so are far less accurate than the 
present invention. 

SUMMARY OF THE INVENTION 

A method of determining the topology of a network of 
objects has been filed for patent* Dawes et *1* U£. Ser. No. 
08^55,729 filed Nov. 16, 1995, VS. Ser. No. 067599310 
filed Feb. 9, 1996 and (unknown) filed Nov, 15, 1996 
incorporated heron by reference. A manual method or some 
alternative automatic method, allows the connectivity of 
communications objects to be determined, 

A new method described below also work£ on unman aged 55 
objects and sets of unmanaged objects, which is novel. 

The invention exploits knowledge of the detailed local 
topology of communicating objects. 

DEFINITIONS 

60 

Communications objects such as routers have multiple 
communications lines. They accept frames from these lines 
and determine from inJbrmaijou in each frame which V*** 
each frame should be sent out on. 

Transit delay: & 

Use time between the receipt of a frame and it? despatch 
Out again is called the traa&it delay. 
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Drop rate: 

Sometimes routing or switching eooimutucaiions devices 
cannot dispatch frames as fast as they receive them and run 
out of memory to store the ones they receive, so they discard 
$ some. In addition, internal queues may fill up and for other 
reasons, frames get lost be twee a acceptance and onward 
dispatch. The overall discard rate is usually called the drop 
rate. 
Brealq 

1 0 C ommnnira tions devices, routing or otherwise, can break. 
The break state for a device is true when it can neither send 
nor receive on any communications line, yet all toe lines arc 
ok. For example, when a device is powered down its break 
state is true. The break state is true for a line when the 
ts devices at each end are not broken and yet cannot send or 
receive traffic across it. For example, a line is broken when 
it is cut through. 
NMC: 

The network management ccotci is the computer which is 
2Q operating the software that performs this method. It also 
either performs interrogaiioD of devices to provide data for 
the method below or receives such data to use in the meihod. 

The NMC periodically requests from each device in a 
commu nira hYms network the amount of tr&iiic flowing in 
25 and out of each interface and the line status (OK or OFF) on 
me line for each interface on thai device. This requesT should 
result in a set of replies from each device returned to the 
NMC Not all devices need report the OK or OFF line status 
values or do so correctly. 

If a device breaks then tbc NMC may detect four changes, 
first that it now receives no replies to its requests of this 
device. Second thai it receives no replies from devices lying 
beyond this device and which are only reachable through 
this device. Third no traffic will now be detected flowing in 
any lines to or from this device. Four the line status bits on 
linos connected to this broken device will change (e.g. from 
ok to off). Any subset of two or more of these four changes 
will be adequate to determine that tbc device is broken. 

If a line between two devices is broken, rbe status bits on 
the interface* at each end may change and rm traffic will 
flow. Should neither device be broken tben and yet should 
either of these conditions be met, then the line itself is 
broken. This diagnosis depends on the device break diag- 
nosis above. 

The drop rale in a device is the difference between the 
mean drop rate measured to devices just beyond it (and 
connected to it) and the mean drop rate measured to devices 
just before it (and connected to it), where closeness is 
measured in terms of the number of hops to the NMC. 
Devices diagnosed as broken should not be included in any 
part of this calculation. 

The mean frame transit delay M a device is the differ ence 
between the mean round trip time measured to devices just 
beyond it (and connected to it) and the mean round trip time 
measured to devices just before it (and connected to it), 
where closeness is measured in terms of the number of hops 
to the NMC Devices diagnosed as broken should not be 
included in any part of this calculation. 

The result is far simpler and far more generally applicable 
method which fives similar or better results. This means that 
all the devices in communications networks can now be 
analyzed, without any undue burdec on the network band- 
width or in machine feciUlies. 

Tu accordance with an embodiment of the Invention, a 
meihod for Determining the mean transit delay of frames 
through one or mors communications devices which receive 
and forward frames. 
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la accordance with, toother embodn&ect, ft method for 
determining the mean drop rate of frames through one or 
more communkatioos devices which receive and forward 
frames. 

In accordance with another embodiment, a method for s 
determining the break state of one or more communications 
device^ tod interfaces or lines to and from ooquaunicfttxofis 
devices, 

In accordance with another embodiment, a method of 
analyzing a communication network comprising deJermin- 10 
tng a mean drop rale in a device x by polling each device 
from a network management computer (NMQ which is in 
communication with the network, and processing signals in 
the NMC to determine a drop rate D(x), in accordance with: 

and Ux)-a-A00 

where 

A(x): the fraction of poll requests from tbeNMCto device 20 
x for widen (he NMC receives replies (measured over 
the last M sampling periods), (wherein device x must 
not be broken), 

D(x): toe mean frame drop rate in device x, 

Uc): NMC* perception of the loss rate to device x and u 
back, 

t^-(x); the NMC** perception of toe mean value of L(z) 
wt all devices z connected to device x, closer to the 
NMC than device x and which aic not broken, and M 
La(x); the NMC's perception of the rrjcaa value of L(z) 
for all devioes z connected to device x, further away 
from the NMC man device * and which are not broken. 
In accordance with another embodiment, a method of 
analyzing a communication network comprising detcrmin- 35 
ing a mean frame transit delay in a device j. by polling each 
deviae from a network management computer (NMC) which 
is in communication with the network and processing sig- 
nals in the NMC to determine a transit delay T(x) in 
accordance with the process: *q 

where 

T(x): the mean frame transit delay for device 1, (wherein 

device x must not be broken), 45 
W(x> the mean round trip time taken between a poll 
request from the NMC to device x and the receipt of the 
reply by the NMC (measured over toe last N sampling 
periods), w 
w-(x): The NMC's perception of the mean value of Wf» 
far all devices 2 connected to device x, closer to the 
NMC than device x and which are not broken, 
W+(x): The NMCs perceptioo of the mean value of W(») 
for all devices z connected to device x, further away $> 
finum the NMC than device x and which are not broken. 
In accordance with another embodiment, a method of 
analyzing a communication network comprising determin- 
ing a break State of communications devices connected in 
the network,, by polling each device from a network man- 60 
agement computer (NMC) which ts in communication with 
the network, and processing signals in (be NMC in accor- 
dance with at least one of 
(a) (i) receiving no replies to polling signals directed to a 
device, 55 
(ii) receiving no replies from devices lying beyond said 
device, 
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(iii) detecting no traffic flowing in any lines to or from 
said device, 

0 V ) detecting changes to line status bits on lines 
connected to said device; 

(b) (i) determining zero traffic on a line and a device being 
otherwise determined as not being broken, declaring 
the line as being broken* 

(u) declaring a line as hciog broken in step (bXi) after 
a predetermined period of time, 

and 

(c) processing steps (a) and (b) with lines having more 
than two ends, as if it were a single device from the 
point of view of breaks. 

BRIEF INTRODUCTION TO THE DRAWINGS 

A better understanding of the invention will be obtained 
by od Diddering the detailed description below, with refer- 
ence to the following drawings, in which: 
FIG. 1 is an illustration of a portion of a network, and 
FIG. 2 is a block diagram of a Structure for supplementing 
the uxvention. 

DETAILED DESCRIPTION OP PREFERRED 
EMBODIMENTS OF THE INVENTION 

The method dciscribed below us general, is independent of 
device type and does not require a device to respond to 
management requests (eg. SNMP> Moreover, the method 
described below works even on objects or sets of objects not 
responding to management requests (e.g, a portion Of the 
network managed by some supplier of oonzmuoJadions 
services), 

EXAMPLE 

Let a portion of a network be as in FIG. 1. 'D' lies closer 
to the NMC than 'x' and 'C and *B* lie beyond V. In other 
words, 'D' is one bop closer to the NMC than 'x' and X' and 
*B' are one bop beyond V. Let none of the devices be 
broken. 

The drop rate in 'x' is the difference between the mean 
drop me measured to 'C* and *B T and the mean drop rate 
measured to *D\ The mean d/op rate measured to 4 D' is the 
fraction Of the requests for information sent by the NMC to 
*D' to which no replies have been received. 'Joe mean drop 
rates to 'C and *B f are computed similarly. 

The mean frame transit delay *x* H the difference between 
the mean round trip time measured to 'C and 'B' and the 
mean round uip time to *D\ 

Should V now break then replies will no longer be 
received from *x\ *bV and *C\ Simultaneously traffic will 
cease between *D 9 and V and the interface en *D* for the 
line 'D' to V will report a Change from < ok' to *Ofi\ 

The software executing the method runs as a software 
module within the same main software process thai executes 
the metbexb described in the aforenoted patent applications. 
This process receive* device replies from a further Software 
process that periodically requests the traffic and status infor- 
mation from all managed devices in the network. The mam 
software uses these relies to determine the topology, and 
once the topology is known, also passes the replies to the 
logic module that executes the method. Changes in bcenk 
state of any object and the current drop and delay values are 
recorded periodically in a database. The NMC operator can 
now observe these changes in information by operating a 
software tool thai examines mis database. An INTEL PASO 
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cpu with 32 MB of memory audaU Gbyte hard drive W(x): the mesa round trip tm» taken 

required only 0.4% of its Cpu to perform real lime analysis from the NMC to «x' and the receipt of the reply by the 

to execute ihis method on data recorded from every man- NMC (measured over the last N simplmg periods) 

a^^ce evciy three minutes from a commumcationa W-(x): The NMCs perception of the mean vtke ofWfc) 
network with 3,000 «jmmunicaiioiw nodes. Teste on over 5 for a ii Alices -r^JL*^ m 7u!L ^T7^\jxx^ 

10,000 Emulated break* on simulated networks of between K^^hkh^^troln ' 

n^'^E^ m^MS^n^ mean v ahl e of W W 

?T ■ w r*™ 05 °«™ ca ocjow. NMC than 'x* and which are not btofceo. 

To determine the drop rote of communications devices; , n tw- — ... r. *^ JT7 1 ■ I. . J>JW . 

The mean frame rate is the probability th* a frame .° *" mc ***** dday 10 * *vict ^ (he difference 

riTsl^^ to d^nce between W+L W-. « W* . nd W- «fer lo 

u,c «r poiL*,*, &om U* nmc * v for round ** " toonewa ^ 

which the NMC receives replies (measured over the last rtfrtwMfrw-wyi — 3 

M sampling periods), 'x* must be not be broken. M 
D(x>: the mean frame drop rale in device *x\ Example 2 

L-(x); The NMCs perception or tfae mean value otUz) tor ^ 

^S^oS^ ' x '< <;Ios « ta ,bcNMC,llM ' **' W(B>0.100 ix. The NMC gets replies from 'BWerage 
tJ^^ST?^ r*.. u ,,,,, M 0.100 5eoowl*»fter it send* 'B'» request 

^C^'x' ^th 1 iSSK 0l <* after it u^'cTz^L ^ 

Sfd^r^ ,„ , £ k ^ W(I)>0.08U e . The NMC gels «pK« fro* 'D'oo avenge 

Tne drop rate in a device is the difference between the 0 .081 seconds after ii seeds <D { a r«wc« 

mean drop rite measured to devices just beyond it (and M TberefoicT^ ^ 

co n n e cte d to it) and the mean drop rate measured to devices VV -ftrVWftJVO 081 

measured in terms of too number of hops to the NMC Note TrT^^V^/^-Wrr^/^-rn irr>_n CAiyn-n 

that in equation 2 the value of Df» fhslf toe difference ^ ( } ^T^ , . ... 

between^ and as t> and^^To^uS i^Z M OiS^ds ^ ^ ^ ^ X " 

^ 0flB way UJ ' P dro ^ Tod^rmine the break ^tc of cotnmunicartons devices: 

Tbcn?folC - (a) Device breajos. 

If a device breaks then the NMC may detect four changes, 

L(ir>o-A(x) gq a i First that it now receives no replies to lb requests of this 

D(n~QA(d-h-WYl «qa 2 40 dcvriuc - S«™<J that it receiver no replies Crom devices lying 

Examnl 1 beyond this device and which are only reachable thxOugh 

lixampie i this dovice. Third ux> traffic will now detected flowing in any 

Let a portion of the network be as in FtG. 1. fines to or from this device. Fourtb that the line clam* bite 

^}'\ on lines connected to this broken device will change (e.g. 

A(B)-0.95 i^. The NMC gets replies to 9S& of its traffic 45 from ok to off). Any subset of two or more of these four 

wii^^S^f 1 * 21 ^ 001 changes will be adequate to determine that the device is 

A(CHX94 ix. The NMC gets replies to 94% of its traffic broken. 

^^"^ if 10 " ' (r * Should changes be in conflict then the presence of traffic 

A(D)-0.90 i.e. The NMC gets replies to 96% of its traffic to or from a device certainly indicates that device is not 

info requests from 'D\ w broken. 

TJttf** 01 ^ Should an interface line status be reported as OFF when 

H2T 1 a 1 "2 ^"^^ w« flowing on a line, then that meaning of OK and 

HS" 1 "!-^" 0 ' 06 OFF are considered reversed for that interface. 

L(D>1-0.96-0.04 jb) Line breaks (2 ends). 

^ X £MW^W _ _ 55 Sbould a device not be broken and it reports zero traffic 

l&y^HEr^Ef)^* 0 v? on a line and a change from ok to off on the interface staius 

D(x)K^C>L<B))^L(D))/2<0.055-0.04>0.007 and the other end of the line also not be broken, then the line 

Iberefore the mean frame Ins* rate in device 'x' tit 0.007. is declared broken. Note that this categorizes the Jine and the 

To determine the transit delay of eommunicatiocj devices: two interfaces arc being a single unit from the point of view 

Ihe mean frame transit delay is how long it takes the 60 of this diagnosis, 

average frame to transit through this device. Should a hue never have traffic reported on an interface in 

£)cfioc: a device and no sums bit changes be detected, then the line 

M: how many sampling periods (he transit delay is to be will be considered broken after a sufficiently Long period of 

averaged over (eg. 4) A sampling period is the interval time, should the devices at both ends not be broken. 

between periodic requests for traffic and status values 65 (c) Unc breaks (>2 ends) 

from interfaces (e«. 30 seconds). 'IX*): the mean frame Aline which has more than two ends is treated as a device 

transit delay for device *x'. 'x' must not be broken. from the point of view of bro&KS. 
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